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No. 09-0460. 

I. REAL PARTY IN INTEREST 

The real party in interest is International Business Machines Corporation, the assignee of the 

present application. 



Dkc ST9-99-130 

(C&C 30571 .179-US-Ol) 



PA6E4f96*RCVDAT8f3l20044;16:57PMpstemDayligMTime]*SVR:USPTO.EFXRF.1i2*DNIS:8729306*CSID:+^ 



08-03-2004 i 2 :27PM FROM-Gates & Cooper LLP 



+13f06418798 



T-958 P. 005 



F-096 



\ 

n. T^KT.ATP.n APPRAIS AN n TlsITF^RFFRENCES 

There are no related appeals or interfeieaces for die above-refereaccd patent application. 

HI. grATTT^ OF CLAIMS 

Qaims 1-57 ate pending in the application. 

Claims 1, 20, and 39 were rejected under 35 U.S.C. §l03(a) as being unpatentable over Bayer 
et aL, U.S. Patent No. 5;202,987 (Bayer) in view of Tsuchida et al. U.S. Patent No. 6,026,394 
(Tsuchida), 

Claims 1, 20, and 39 were rejected vmdcr 35 U.S.C. §l03(a) as being unpatentable over 
Bhattacharya et aL, U.S, Patent No. 5,797,000 (Bhattacliarya), 

Claims 2-3, 21-22, and 40-41 were rejected under 35 U.S.C §l03(a) as being unpatentable 
over Bhattacharya as applied to daims 1, 20, and 39, in view of Jlintz et aL, U.S. Patent No. 
5,222,235 (Hintz). 

Claims 4-6, 23-25, and 42-44 were rejected under 35 U,S,C. §l03(a) as being unpatentable 
over Bhattacharya as applitid to ckiraa 1, 20, and 39 in view of Bordonaro et al., U.S. Patent No, 
5,307,485 (Bordonaro). 

Claims 7-11, 26-30, and 45-49 were rejected under 35 U.S-C 5103(a) as being unpatentable 
over Bhattacharya as applied to claims 1, 20, and 39 in view of "Office Notice" (ON). 

Claims 12-19, 31-38, and 50-57 were indicated as being allowable if lewtitten in independent 
form to include the base claim and any intervening claims, 

rv. STATUS OF AMENDMENTS 

Subsequent to the final rejection, no claims have been cancelled, ataendcd, or added. 

V. SUMMARY OF THE INVENTION 

Appellants' invention, as recited in independent daims 1, 20 and 39 are directed to loading 
data into a data store connected to a computer. Independent claim 1 is representative and comprises 
the steps o£ 
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identifying memory constraints; 
identifyisig processing capabiliucs; and 

determining a number of load and sort processes to be started in parallel based on die identified 
memory conscraints and processing capabiKries, 

With regard to die claims, refer to the specification as foUo^cs: 

(a) at page 2, lines 1 4-23j 

(b) at page 3, line 26 through page 6, line 24, and in FIGS, 1-2; and 

(c) at page 7, Enc 1 duough page 14, line 23, and in FIGS, 3, 4, 5A-B, GA-B and 7A-B. 

VI. TSSIIFi; PRESENTED FOR REVIEW 

1. Whether daims 1. 20, and 39 are obvious under 35 U.S.C §l03(a) over Bayer et aL, 
U.S. Patent No. 5,202,987 (Bayer) in vie^xr of Tsuchida et aL, U.S. Patent No. 6,026,394 (Tsuchida). 

2. Whedier daims 1, 20, and 39 are obvious under 35 U,S.C. §l03(a) over Bhattacharya 
et aL, U.S. Patent No- 5,797,000 (Bhattacharya). 

3. Whedier claims 2-3, 21-22, and 40^41 are obvious under 35 U,S.C. §l03(a) over 
Bhattacharya as applied to daims 1, 20, and 39, in view ofHintz et al, U.S. Patent No. 5^22^35 
(Hiniz). 

4. Whedier claims 4-6, 23-25, and 42-44 are obvious undct 35 U.S.C. §103(a) over 
Bhattacharya as applied to daims 1, 20, and 39 in view of Bordonaro et al., U.S. Patent No. 
5,307,485 (Bordonaro), 

5. Whedier claims 7-11, 26-30, and 45-49 are obvious under 35 U.S.C. §103(a) over 
Bhattacharya as appUed to daims 1, 20, and 39 in view of an "Office Notice" (ON). 

viL CtRouping of claims 

The rejected r-i t^ir^ft do not stand or fall together. The daims are grouped as follows: 

(a) claims 1, 7-11, 20, 26-30, 39 and 45-49; 

(b) daims 2, 21 and 40; 

(c) daims 3, 22 and 41; 

(d) claims 4, 23 and 42; 
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(e) Haims 5, 24 and 43; and 
(£) daitns 6, 25 and 44. 
Separate aiguments for the patentabflity of each group of daims are provided below, 

Vin. ATtailMFNTS 

A, The Office ^ cpan Rqections 

In paiagtaphs (2)-(3) of the Office Action, claims 1, 20, and 39 wete rejected under 35 
U.S.C. §l03(a) as being unpatentable over Bayer et al., U^. Patent No. 5^02,987 (Bayer) in view of 
Tsuchida eu aL, U.S, Patent No. 6.026,394 (Tsuchida). In paragraph (5) of the Office Action, claims 
1, 20, and 39 were rejected under 35 U.S.C. §1 03(a) as being unpatentable over Bhatcacharya et aL, 
U.S. Patent No. 5.797,000 (Bhattacharya). In paragraph (7) of the Office Actbo, claims 2-3, 21-22, 
aod 40-41 were rejected under 35 U.S.C. §103(a) as bdng unpatentable over Bhattacharya as applied 
to claims 1. 20. and 39, in view of Hints 3t J., U.S. Patent No. 5,222.235 (Hintz). In paragraph (1 1) 
of the Office Action, claims 4-6, 23-25, and 42-44 were rejected under 35 U.S.C. §1 03(a) as bdng 
unpatentable over Bhati^charya as applied to daims 1, 20, and 39 in view of Bordonaro et al.. U.S. 
Patent No. 5,307.485 (Bordonaro). In paragraph (16) of the Office Action, daims 7-11. 26-30, and 
45-49 were rejected under 35 U.S.C. $l03(a) as being unpatentable over Bhattacharya as applied to 
daims 1, 20, and 39 in view of an "Office Nodce" (ON). However, in paragraph (18) of the Office 
Action, rUln.^ 12-19, 31-38. and 50-57 were indicated as being allowable if rewritten in independent 
form to inchide the base claim and any intervening claims. 

Appellants' attorney acknowledges die indication of allowable claims, but respectfuUy 
traverses the rejecrions. 

B, AppeUancs' Independent Claims. 

Appellants' independent claims 1, 20 and 39 are directed to loading data into a data store 
connected to a computer. Independent claim 1 is reptesentarive and comprises the steps of: 
identifying memory constraints; 
identifying processing capabilities; and 
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deiermkimg a number of load and sort processes to be started in paianel based on tiie idcatified 
memory constraints and processing capabilities. 

C, The Bayer Tlefermcfc 

Bayer describes a high flow-rate synchronizer/schednler apparatus for a multiprocessor 
system during program mn-rime, which comprises a connecdon matrix for monitoring and detecring 
computadonal tasks which ate allowed for execution containing a task map and a networit of nodes 
for distributing to the processors information or computational tasks detected to be enabled by the 
connection matrix. The network of nodes possesses the capability of decomposing information on a 
pack of allocated computational tasks into messages of finer sub-packs to be sent toward the 
processors, as well as the capability of unifying packs of information on termination of 
computational tasks into a mote comprehensive pack A method of performing the 
synchrooization/scheduling in a multiptocessor system of this apparams is also described. 

D. The Tsuchidft Reference 

T^hida describes a database management system for executing database operations in 
paiallcl by a plurality of nodes and a query processing metiaod. The database management system 
contains a decision managemenc node for deciding a distribution node for retrieving information 50 
as to analyze a query received &om an application program, generate a processing procedure for 
processing tlie query, and execute die process, and a join node for sorting, merging, and joining the 
information retrieved by die distribution node. When die query process is executed, die distribution 
node decided by the decision management node retrieves the information to be processed and the 
join node decided by the decision management node also obtains the result for the query from the 
retrieved information. The query result is outputted &om an output node and transferred co the 
application program. 

E. The BhattacharvQ Refetence 

Bhattacharya describes a method of performing a parallel join operation on a pair of 

xebtion$ Rl and R2 in a sysuem containing P processors organized into Q dusters of P/Q 
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processors each. The system contains disk storage for each cluster, shared by the processors of that 
duster, cogedier with a shared interraediatci tnemory (SIM) accessible by all proc(i3sots. The relations 
111 and R2 to be joined are first sorted on die join column. The underlying domain of the join 
colunm is dien partitioned into P ranges of equal size. Each range is further divided into M 
subranges of progressively decreasing size to create MP tasks T.sub.m,p, the subranges of a given 
range being so sized relative to one another that the estimated completion rime for task T.$ub.m^p is 
a predetermined fraction that of task T.sub.m-l,p. Task$ T.subjn,p with larger time estimates ate 
assigned (and the corresponding mples shaped) to die cluster to which processor p belongs, while 
tasks with smaller time estimates are assigned to the SIM, which is regarded as a universal chister 
(cluster 0). The actual task-to-processor assignments are determined dynamically during the join 
phase in accordance with the dynamic longest processing time first (DLPT) algotithm. Each 
processor widiin a cluster picks its next task at any given decision point to be the one with the 
largest time estimate which is owned by that cluster or by cluster 0. 

F. The Hintz_Reference 

Hintz describes a reorganization method of DB2 data files exploring parallel processing, and 
asynchronous I/O to a great extent. It includes means to estirnatc an optimum configuration of 
system resources, such as storage devices (DASD devices), memory, and CPUs, etc, during 
reorgann^ations. The method mainly consists of four components, (1) concuttent indexing, (2) 
concurrent unloading of data file partttions, (3) efficient reloading of DB2 data pages and DB2 space 
maps, and (4) means to reduce access constraints to the DB2 recovery table. 

G. The Bordonaro Reference 

Bordonaro describes a system and method for merging a plurality of sorted hsis using 
multiple processors having access to a common memory in which N sorted lists which may exceed 
the capacity of the cotmnon memory are merged in a parallel environment. Sorted hsts fi:om a 
storage device are loaded into common memory and are divided into a number of tasks equal to the 
number of available processors. The records assigned to each task are separately sorted and used to 
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foim a single sorted Ust. A m^ilti-pjcocessing eaviiornnenc rakes advantage of its organizatioa dmmg 
the creation of the tasks, as well as dunng the actual sorting of the tasks, 

H, Appellants^ Independent Claims Are P grentable Over The References 
Appellants' attorney respectfully submits that Appellants' claimed invention is patentable 
over the references. Specifically. AppeUanta' artotney asserts that die references do not teach or 
suggest the limitations recited in Appellants' independent claims 1, 20 and 39, 

With regard to the rejections hased on Bayer and Tsuchida, the Office Action states the 
following: 

3. Claims 1, 20» and 39 ate rejected under 35 U,SX. 103(a) as being 
unpatentable over Bayer et aL (US Pat 5,202,987, hereinafter Bayer) in view of 
Tsuchidft et al. (US Pat 6,026^94, hereinafter Tsuchida). 

4. Regarding claims 1, 20, and 39, Bayer et aL disclose a mediod of 
loading data ioto a data store connected to a computer^ the method comprising the 
steps of: 

identifying memory constraints (col. 1, lines 13-15, memory and processors 
are operations botdeneck and col 5, lines 52 - 56, memory is constrained or limited 
diroi^ factors such as physical shared storage, network acce$$ or processor 
distnburion, and common memory space); 

identifying pxocessiag capabilities (col. 1, lines 17-27, synchronization 
activities are controlled by algorithm, which depends on processing power and col. 5, 
Imes 24-31 requires the number of processors and capabihties of each processor, 
which entail processing capabilities); and 

determining a number ofload (coL 14, lines 25 - 33, loading capacity being 
part of task map) to be started in parallel based on die identified memory constraints 
and processing capabilities (coL 7, lines 9 -1)- 

Although Bayer disclose the sort process being a mere tasks allocation to the 
processors (coL 1 , lines 44 - 50), Tsuchida has neverdieless further detailed die sort 
feature, which iacludes the step of determining a number of sort processes (coL 8, 
lines 50 - 51 disclose the fact that the sorting process depends on the number of 
node for joLu process. CoL 7, hnes 54 - 57 show that the number of join nodes for 
performing merge process can he determined- Hence^ number of sort processes is a 
known quannty). 

It is considered obvious to one of ordinary skiH in the art, at the time the 
invention was made, to combine the sorting feature shown by Tsuchida to the 
invention of Bayer so that sort processing time, which is a factor in load baWciiig 
processes can be determined as part of system characteristics and optimisation 
purposes (Tsuchida: coL 7, lines 58 - coL 8, line 35). Note diat the sort steps shown 
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by Tsuchida are also parallel processes as claimed in the application (fig- 3, parallel 
pipeline operarion). 

The 0£&ce Action also stares the following: 
Response to Arguments 

19. AppUcatxt's argoments filed 12/30/03 have been ftOly considered but 
they are not persuasive for the reasons set forth below, 

20. In response to applicants' remarks, page 20, lines 1 - 3, and page 22, 
4th paragraph, regarding Bayer, Tsuchida and Bhattacharya references do not teach 
or suggest tiie limitation *'deterrmning a nximber of load and sort processes to be 
started parallel based on the identified memory constraints and processing 
capabilities", this has akeady been addressed as stated in die rejection above. 

Additionally, Tsuchida's fig. 2 teaches a plurality of processors implementing 
parallel operations in a database management system. Fig 3 teaches management 
node 12 determines the distribution process with the remeval data (loading process) 
by execution on the basis of number of load and sort process (col. 10, line 60 - coL 
11, line 16). Tsuchida also teaches of dynamic optimization process to produce an 
optimal result within the system processing capabihties by analyzing the processing 
procedvires that ate applicable based on the constraints (fig, llf, 12b) including 
calculating the processing time consideting each system characteristic (coL 3, lines 6 - 
47,coL 12, lines 9 -35). 

Appellants' attorney disagrees. The cited portions of diese references do not teach or 
suggest the hmitarion "detemrining a number of load and sort processes to be started in paraBd based 
on die identified memory constraints and processing capabilities." 

For example, the cited portions arc set fonh below: 

Bayer: CoL 1. lines 13-27 

The coordination of multiple operations in shared memory multiprocessors 
often constitutes a substantial perfonnance botdcneck. Process synchronization and 
scheduling are generally performed by software, and managed via shared memoiy. 
Execution of parallel programs on a shated-memory, speedup-orienied 
multiprocessor necessitates a means for synchronizing the activities of the individual 
processors. This necessity arises due to precedence constraints vtithin algorithms: 
When one computation is dependent upon the result of other computations, it must 
not commence before they finish. In die general case, such constcaints are projected 
onto an algorithm's parallel decomposition, and reflected as precedence relations 
among its execution threads. 

Bayer- Col. 5. lines 52-56 (actually. 45-56^ 
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Tlie general architecmral concepts described so far may be implemented in 
multiple alternative ways. The processors imy range &om minute processing 
elements to large scientific processors. They are not limited to any specific type, and 
are not confintid ro lihe von-Neumann model. They can also be compound 
processing units. The architecture may also be applied to non-homogeneous systems. 
The shared memory may consist of physically shared storage, possibly accessed 
through an interconnection network, or be distributed over the processors, as long as 
common memory space is preserved, at least m part 

Ba yer: Col. 5. lines 24-31 factuaUy. 24-44^ 

In addition to a task map, die synchronizer/ scheduler is supplied ^ith die 
system configuration datau This includes such details as the number of processors, 
the capabilities of each processor (if processors are not a-prioid identical), etc. 

Given a set of enabled tasks, as well as processor availabiHty data, the 
synchronizer/scheduler then performs scheduling of those tasks. Any non-random 
scheduling policy must rely upon some heuristics: Even when task execution times 
are known in advance, finding an optimal schedule for a program represented as a 
dependency graph is an NP-complete problem. Most scheduling heuristics are bases 
on the critical padi method, and thereby belong to the class of Ust scheduling 
policies; i,e-, policies that rely on a list of fixed task prioriries. List scheduling can be 
supported by the inventive scheme described herein, by embedding task priorities in 
the task map load-module submitted to the synchronizer/ scheduler. Whenever an 
allocation takes place, the allocated tasks are those which have highest priorities 
amongst the current selection of enabled tasks. 

BavgTT Col. 14, lines 25-33 
Charactericing Parameters 

The parameters characterizing a Specific synchronizer/ scheduler can now be 
summarized: 

Loading Capacity: 

The maximal size of a task map which can be loaded. This parameter is 
expressed in i-^t^n^ of quantity of taslcs, and/ or in terms of quantity of dependency 
connectLorus. 

Bayer: Col 7. lines 9-11 ractually, 9-13) 

The multiprocessor architecture is illustrated in FIG, 1. As can be seen, the 
parallel operation coordination subsystem (synchronizer/scheduler 10) forms an 
appendage to a conventional configucation of a shared-memory 12 and processors 
14. 

Bayer: Col, 1. lines 44-50 (actufllly, lines 28-51"^ 

Synchronization is only one aspect of a broad activity, which may be termed 
parallel operation coordination, whose other aspects are scheduling and ^ork 
allocation. Scheduling is selecting an execution order for the operations of a 
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program, out of a space of execution orders which are feasible undet the gjven 
architecmre and pxecedence constrainrs, as described in the paper enuUed **The^^ 
Effect of Operauon Scheduling on the Peiformaxice of a Data Flow Computer " M- 
Gransl^y et al, IEEE Trans, on Computers, Vol. C-36 No. 9, September 1987, pp. 
1019-1029. While scheduling deals with the point of view of the tasks to be 
computed, wodc allocation deals with the point of view of die piocessors which 
carry out die tasks. Thus, die disdncdoi^ between scheduling an allocation is not 
dear-cut, and some researchers use diese terms interchangeably. The decisive 
questions maybe posed as follows: 'Svhich ready-to-run piece of work should be 
executed first ?" which is a matter of scheduling policy; questions of tiie sort "to 
which processor should a given piece of work be allocated ?" or **how much work 
should be allocated at once to a given processor are considered to be a matter of 
allocation poUcy. Scheduling and allocation may be static, ie. determined before 
program run-rime, 

Tsuchida: Col- 8. lines 50-5 1 rflcmallv. 44-65^ 

FIG. 8 is a schematic view of the mning by die slot sott preprocessmg. Iq die 
same way as in FIG. 7, it is assumed that die data retneval/ distribution process is 
executed in die nodes #1 to #8 and the processing time in each node is the one 
shown at each of die numbers 300 to 305. ITie processing rime in each node varies 
widi the number of data in each table. The slot sorting process is set so as to be 
executed by die nodes fot join process. When the processing time vanes with each 
node, the processing procedure for transferring die slot sorting process to the nodes 
for data retrieval/distribution is considered. For example, iq a node where the data 
retrieval/distribution process is expected to end earlier as slot sort preptocessiag» the 
slot sorting process is executed as shown at 306 to 309. By performmg the sk>t sort 
preprocessing in this manner, the slot sort processing nme by the nodes for join 
process can be reduced to about the value shown at 312. Using die reduced 
processing rime shown at 311, the N-way merge process is transferred. This is 
nothing but extension of die run letlgdi of die slot sorting process. By doing dus, the 
time 320 requited for the N-way metge ptocess can be reduced and as a result, the 
total response time can be reduced. 

Tsuchida: CoL 7, line 54 - roL 8. line 35 

Next, the method for deciding the number of joiu nodes for performing the 
N-way metge process will be explained with reference to FIG, 7, FIG* 7 is a 
schematic view for explaining the decision method for the number of join nodes. In 
FIG. 7, graphs of the phases of parallel joia process explained in FIG. 3 and of die 
processing time of each process are made and laid out according to the parallel 
pipeline operation explained in FIG. 4. In FIG. 7, it is assiimed that the data 
retrieval/distribution process is executed in the nodes #1 to #8 and the processing 
time in each node is the one shown at each of the numbers 300 to 305. In dus 
eaiample, the processing time 304 in the node #5 is the maximum processing time. 
The 3lot sorting processing time can be driven from the number of nodes for join 
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process N, predetemiined system diaiacteristks (CPU performance, disk unit 
perfottmncc, etc), and database operation method. The performance characteristic 
processing time Es) of the slot sorting process can be obtained generaUy &om the 
foUo'oring expression. 

Es=a/N+b*N+c (1) . . 

The N-way merge processing time (Em) and join ptocessmg tmie (Ej) also 
can be obtained from the following expressions. 

Em=d/N+e*N+f (2) 

Ej=g/N+h*N+i (3) 

where, symbols a, d. and g indicate constants which are decided from sysrem 
charaaeristics such as rhs number of tows, the number of pages, each operation umt 
time, and output time. Symbols b, e, and h are constants which are decided from 
system characteristics such as the communication dnie, and c, f, and i are constants 
which are decided from the Other system characteristics 

According to this embodiment, to maximize the effect of the pipeline 
process, the number of nodes for join process is obtaiaed as the number of assigoed 
join nodes 350 so that dm performance charactierisac Es of die slot sorting process 
^becomes equal to the maximom processing time 304. When die number of assigned 
join nodes 350 is determine4 the N-way mei^e processing time 320 and jom 
processing time 330 can be esdmated from the equations (2) and (3). The tool of 
these processing rimes is die total processing time for a query. By deciding the 
number of join nodes in this manner and merging die data distributed in die data 
reoieval/distribudon process successively and processing diem simultaneously, the 
total processing rime (response rime from querying to output) can be shortened. 
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TsuchidA: FIG 3 




Tsuchida! Col 1 0- line 60 - col. 1 1 . line 49 

FIG. 11(e) shows a detailed flow chart of the process fior generation of 
processing procedure candidates (Step 2213). The process for generation of 
processing procedure candidates checks whether the table to be accessed for the 
query is separately stored in a plurality of nodes (Step 22130). When the table is 
separately stored in a plurality of nodes, die database management syscera goes to 
Step 22135, When the table is not separatd.y stored, the process for generation of 
processing procedure candidates checks whether the sorting process is necessary for 
execuring the query (Step 22131). When the sorting process is necessary for the 
query process, die database management system goes to Step 22135. When the 
sorting process is not necessary for the processing procedure candidates, the process 
for generarion of processing procedure candidates checks whether there is only one 
access path for the table to be accessed for the query (Step 22132). When there is 
only one access path, the process for getietation of processing procedure candidates 
generates a single processing procedure corresponding to the access padi and ends 
die processing (Step 22133), When there is not only one access path the process for 
genecadon of processing procedure candidates generates a plurality of processing 
procedures corresponding to the access paths and ends the processing (Step 22134). 
At Step 22135> the process for generation of processing procedure candidates 
decomposes the query to two-way joins which are joinable. Next, the process fox 
generation of processing procedure candidates generates processing procedure 
candidates for data read on the basis of the registered access path candidates and 
processing procedure candidates for data distribudon according to the 
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decompositioQ result at Step 22135 in correspondence with the storing nodes where 
the table is sep^titely scored. The process for generation of processing procedure 
candidates also generates processing procedure candidates for slot sorring when the 
slot sorting process is to be executed in the storing nodes. The process for 
gencmation of processing procedure candidates registers the processing procedure 
consisting of a combination of these processing procedure candidates as a processing 
procedure candidate in each distribudon node (Step 22136). The process for 
geaeraiion of processing procedure candidates registers the processing procedure 
consisting of a combination of the slot sorting process procedure^ N-way merge 
processing procedure, and join processing procedure as a processing procedure 
candidate in each join node in correspondence with each join processing node. Then, 
the process for generation of processing procedure candidates parameterizes the sloe 
sorting run length and the number of mei^g times (Step 22137). The process for 
generation of processing procedure candidates registers the requested data output 
processing procedure to the requested data output node as a processing procedure 
candidate in the output node (Step 22138). Finally, the process fot generation of 
processing procedure candidates ends the processing when the decomposition 
results are all evaluated and repeats Step 22135 and the subsequent steps when any 
decomposition testJts not evaluated (Step 22139). 



Tfiuchida: FIG llf 
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Tgucliida: Col. 3, linag; 6-47 

Furtheimore. the pluxality of nodes include a.t least ooe decision managenient 
node having an analysis means of receiving a quexy^ analyzing the queiy, and 
generating tib.e qnety processing procedure, a decision means of deciding the 
distribution nodes and join nodes for performing the execution process on the basis 
of the query analysis resxalt of the above analysis means, and an output means of 
Outputring the result for the query obtained from tie join node> The decision means 
o£ lie decision management node desirably decides die distribudon node on die 
basis of die query analysis result of the analysis means, caloilates the expected 
processing time in the distdibudon node, and decides die join node on the basis of 
diis processing time. 

The decision means distributes retrieval informarion eqiially to each join 
node on the basis of the expected retrieval informarion amouat in the decided 
distriburion node. Each of the distribution nodes decided by die decision means 
tetricves information from the storage mean^ on the basis of the query analysis result 
and distributes the information to anodier node. The join node inputs information 
distributed from the distriburion node one by one and processes each inputted 
information. The distriburion node and join node process information 
independently. Each of the join nodes sorts informarion distributed fi:om the 
distribution node, merges the sorted informarion when it consists of a plotality of 
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information types, joins a query on the basis of tte merged information, and outputs 
the result for the query obtained &om the join node. 

To assign retrieval infoomtiDn equally to the join nodes by the decision 
means in a more desirable form, die decision management node has a storage means 
of storing column value frequency information relating to the information of die 
storage means of each node. 

According to the query processing method of the present invention, the 
number of nodes can be decided in correspondence with die database operation 
which is executed in each node. When there is a scattering in distribution of data, the 
data is equally distributed to each node, and each database operation to be executed 
in each node is parameterized, and the expected processing times are equalixed- 
Therefore, the ptocessing rime in each node is not biased and the pipeline operation 
can be performed smoothly. 

Tsuchida: CoL 12. lines 9-^"^ r^cmfllly. coL 12, line 9 - cqL line IQ) 
FIG. 12(b) is a flow chart showing die detailed procedure of die process for 
dynamic optimization (Step 223). The process for dynamic optimiisarion checks 
whether diere is only one processing procedure generated by die process for query. 
When there is only one processing procedure, there is no need to execute the 
process for dynamic optimization and the database management system goes to the 
process for code interpretation execution without doing anything (Step 22300). 
When a plurality of processing procedures are generated by the process for query 
analysis, the process for dynamic optimization calculates die predicate selectivity 
based upon the substimted constant (Step 22301). Then, the process for dynamic 
optimization checks whether processing procedure candidates which are executed in 
parallel by a plurality of nodes are contained (Step 22302). When no conesponding 
processing procedure is contained, the process for dynamic optimization selects die 
processing procedure according to the threshold for access path selection and ends 
the processing ^tep 22313). When a plurahty of processing procedures which are 
executed in parallel are contained, the process for dynamic optimization inputs the 
column value ftequency information (the join column value frequency information, 
die number of rows and the number of ps^es in the table which are to be accessed, 
etc.) from die dictionary (Step 22303) and calculates the processing rime for data 
retdeval/distribution as mentioned above by considering each system characteristic 
(Step 22304), Then, tlie process for dynamic optimization decides the number "p** of 
nodes to be assigned to the join process from the processing time calculated at Step 
22304 and selects the processing procedure "al" for realizing the process explained 
in FIG. 7 from the processing procedure candidates (Step 22305), Next, the process 
for dynamic optimJxation checks whether there is a scattering in die data 
retrieval/distribution processing time in the data retneval/distabution nodes (Step 
22306). When there is a scattering in the data retdeval/distribution processitjg time, 
the process for dynamic optitoization selects the processing procedure "a2" for 
executing the slot sotting process by nodes which can afford to execute the data 
retrieval/distribution process among die data retrieval/di&ttibuaon nodes, that is, for 
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realmng the pioce5$ (ixpkined in FIG. 8 (Step 22307). The process for dynamic 
optiinisation increases the number "p" of assigned join nodes as much "alpha" 
and selects die proc&saing procedure "a3" for realizing the process expbined in FIG. 
9 (Step 22308). Furthermore, the process fot dynamic optitnizarioa compares the 
requested data output processing time with the sum of die join processing time ^d 
the last round of N-way merge processing time and when die former is greater dian 
the latter (Step 22309), selects die processing procedure "a4" for realizing the 
process in which die last round of N-way merge process is transferred to die join 
process as e:cplmned in FIG. 10 (Step 22310). In consideration of die response rime, 
die load of each node, and die effect on the response performance of other 
transacrions, the process for dynamic oprimizarion selects the be$t suited processing 
procedure among the processing procedures *'al" to "a4" which axe set above (Step 

22311) . After the processing procedure is selected, the process for dynamic 
oprimixadon generates the data disttibution information to be used for the data 
dismbudon process on the basis of the column value frequency information (Step 

22312) . When there is no column value ftequency information, the process for 
dynamic optimization generates the data distribution information according to die 
join colunm evaluation value of the hash function. Finally, die process fot dynamic 
opornization deddes die processing procedure which is executed finaUy accord^ 
the threshold for access path selection and the process for dynamic optimization 
ends (Step 22313). 

The descriptions set forth above do not teach or suggest the Umicarion "determining a 
number of load and sort processes to be started in paralH based on die identified memory constraints 
and processiQg capabilities-" 

Instead, Bayer is directed to the scheduling of synchronized tasks, but casks are assigned to 
processors based on precedence and availability, wherein one computation is dependent on another, 
and wherein a processor is allocated a new task immediately after it terminates the previous one. 
Moreover, the "Loading Capadt/' referred to above at coL 14, lines 25-33 of Bayer, as a 
"Characterizing Parameter," relates to the maximal size of die task map, wherein die task map is a 
data stmcture that identified dependcndcs berween casks being performed, and is used as indicated 
to assign casks to processors based on availability. 

Similarly, Tsuchida is directed a parallel processing diitabase management system, wherem 
die number of nodes for a join process is obtained so diat a performance characteristic becomes 
equal to the maximum processing time, ie., a decision means determines which (aheady startec^ 
nodes are to be used to perform die query in order to minimize the expected processing time. 
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As a lesult, aeidier of the Bayei or Tsuchida references teach or suggest "detenmfling a 
aumber of load and sort processes to be smted in paraM based oa d.e identified memory consrcaints 
and processing cap^ibilitics » Consequently, it caimot be said dxat the combination of Bayer md 
Tsuchida teaches or suggests, or renders obvious, the Appellants' independent daims. 

Widi regard to the rejections based on Bhatrachary^ the Office Action States the following: 

5 Claims 1, 20. and 39 are rqected under 35 U.S.C. 103(a) as being 
unpatentable over Bbattachaxyaetal. (US Pat 5.797,000, hcrdnafterBhattachars,a)_ 

6 As per .-l^imc 1, 20. and 39, Bhattachaiy et aL disclose a method ot 
loading data into a data store connected to a computer, the mediod comprising the 

°ldenti^g memory constraints (coL 9, lines 6 - 7, memory becomes a 
constraint as its capacity is a contributing factor and is limited); 

ideniif/ing processing capabilities (fig. 1, number of processors p col. 4. Jme 
41- coL 5, line 8, each processor is assigned widi a specific number of tasks, hence 
indicarinc each limited capability); and , • j 

determining a number of load (col. 3, lines 1 - 3, jom cohirm domain and 
tuples are the load, which obviously much be known in order for tiiem to be 
piririoned and transferred among the duster, coL 3. lines 7 -18), and sort processes 
(coL 2 line 62 - coL 3, line 6 disclose various method of parallel sort process in 
which'metgc join is one example. Since the actual tasks assigned to the processors 
are detennmed during the join phase, which is part of the sort process =^ shown 
above, number of sort processes are hence inherendy determmed as well) to be 
started in parallel based on the identified memory constraints and processing 
capabilities (col. 1 , hnes 24 - 28, coL 4, line 64 - col. S, line 8: parallel tasks based on 
processing capabilities, col. 2, lines 66 - col. 3. line 6: parallel sort processing). 

Appellants' attorney disagrees. The cited portions of the reference do not teach or suggest 
the limitations "determining a number of load and sort processes to be started in parallelbased on the 
identified memory constraints and processing capabilities." 

For example, the cited portions are set fordi below: 



ttl^arT^charya: Col. 9. lines 6-7 (flrmally 6-17) , m r i. 

Each processor 104 of the system 100 is allotted an equal pomon 1/P ot the 
memory capacity of universal cluster 108. In die initial portion of die transfer phase, 
for each processor p (104) of die system 100. die tasks T.5uh.m,p corresponding to 
that processor and residing on a particular cluster 102 are transferred &om chat 
cluster to die universal cluster 108, beginning with die task T.sub.M.p having the 
smallest estimated completion time and progressing in order of mcreaang task size 
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(i.e., decreasing m), unril the aflotted portion 1/P is filled (step 532). The remaining 
tasks T.sub.m,p for esich processor p (104) sire transferred to the duster 102 ov;mng 
the processor J xinless they are akeady resident there (step 534). 



Bhattach ar y^; FIG. 1 



1{tO 



rp m PHMES50RS cp 



103 
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Bhattacharya: Col. 4. line 41 - cqL 5. line 8 

Referring to FIG. 1, a multiprocessor system 100 incoiporating the present 
invention includes P processors 104 organized into Q equal-size clusters 102^ each 
cluster containing P/Q processors. Each processor 104 may be either a uniprocessor 
or a complex of righdy coupled processors (not separately shown) that, for the 
purposes of task assignment, are regarded as a single processor. Each cluster 102 also 
includes one or more direct ^iccess storage devices (DASD) 106> which are magnetic 
disk drives in the system 100 shown. Each processor 104 within a cluster 102 con 
access any storage device 106 in the same cluster, but cannot access any Storage 
device in any other cluster 102. Processors 104 are iuterconnected to one another as 
well as to a single intermediate memory (SIM) 108, to which each processor has 
access. SIM 108 is also referred to herein as the universal cluster, or duster 0. In 
addition to the memory 108 and storage devices 106 shown, each processor 104 abo 
lias its own main memory (not separately shown). In the case of a processor 104 
comprising a tighdy coupled processor complex, such main memory would be 
shared by die processors of the complex. The elements shown in FIG. 1 are 
conventional in die art, as are the interconnections between these elements. 

Processors 104 are used for the concurrent parallel execution of tasks making 
up database queries^ as described bdow. A query may originate either from one of 
the processors 104 or from a separate front-end qu^ty processor as described in the 
concuitendy filed application of T. Borden et aL, Ser. No. 08/148,091, now U.S. Pat, 
No, 5,495,606, As further described in that application, within each duster 102 the 
query splitting and scheduling steps described below may be performed by an 
additional processor or processors (not shown) similar to processors 104; such 
additional processors would not be counted among the P/ Q processors 104 per 
complex 102 to which tasks are assigned. 

Bhattacharva: Col. 3, lines 1^3 (acmally col, 2. line 66 - col. 3. line 6) 
In a parallel sort merge join, the relations to be joined are first sorted, in 
parallel, within theit dusters 102 (FIG 1)- In a naive parallel sort merge join, th<^ 
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underlying join column domain might be partirioned into P ranges of equal size, and 
ttc mples txansferted accordingly among the dusters 102. Howe^^et, giyen a 
noniMiiform distribution of tuples across the underlying domain, there is no 
guarantee that the amount of Join phase work will be equal. 

•Rhan^charyai Col. 3> line-s 7-18 racraallv lines 7-24> 
In accordance with the present invention, each of the P ranges is funhet 
divided into a relatively small number M of components, creating MP tasks 
T.sub.m,p in alL These components intentionally have nonequal task time estimates. 
For exai^ple, a reasonabk approach would be to partition the tasks so that the 
estimated completion time of a task T.sub.m,p is half that of the previous task 
T.sub.m-l,p. Assuming that the quadratic outpiat term dominates the task time 
estimates, d:ai5 can be done by partitioning the tasks in such a manner tliat the extent 
of the range of a given task T.sub.m,p (to which the number of tuples in the task is 
roughly proportional) is l/.sqroot.2 times the number of mples in task T.sub.m-l,p. 
FIGS, lOA and lOB show an example of such a parririoning. FIG. lOA shows 
estimated task times as a function of m and p, and FIG. lOB shows actual task times, 
also as a function of m and p. The latter may be different from the former, and will 
not be known until the join phase, when the tasks are actually performed. 

'Rhart^charya: CoL 1. lines 24-28 

This invention relates generally to a method of performing a parallel query in 
a multiprocessor environment and, more particularly, to a method for performing 
such a query with load balancing in an environment with shared disk clusters, shared 
intemiediate memory or both. 

The dcsctiptions set forth above do not teach or suggest the limitation "determining a 
number of load and sort processes to be started in parallel based on the identified memory constraints 
and processing capabflities/" 

Instead, Bhattacharya is directed to a parallel join operation, wherein tasks are assigned to 
processors based on the parurioning of the domain of tiae join column into P ranges based on the 
number of processors and the partitioning of each range into M subranges based on the estimated 
completion time for the task. 

As a result, Bhattacharya does not teach or suggest the limitations "determining a number of 
load and sort processes to be started in parallel based on the identified memory constraints and 
processing capabilities." Gonsequentiy, it cannot be said tiiat Bhattacharya renders die Appellants' 
independent claims obvious. 
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Hmt2, Bordonaro smd ON fail to overcome the deficieacks of Bhiitracharya. RficaU that 
Umtz was dted only against dependent claims 2-3, 21-22 and 40-41, Bordojiato was cited only 
against dependent claims 4-6, 23-25 and 42-44. and ON was cited only against depende£it claims 7- 
11, 26-30 and 45-49. Moreover, Hintz was cited only for determining a mmber of buUd processes 
based on die number of sort processes, and for teaching that the number of sort processes does not 
exceed a number of indexes to be built, Bordonaro was cited only for teaching diat die number of 
load processes does not exceed a number of partitions to be loaded, and that the load and sort 
processes directly dependent on memory constraints, and ON was cited only for teaching to 
efficiently utilize all processing capabilities required for die desijccd task. None of these teachings are 
relevant to the limitations of Appellants* independent claims. 

Thus, Appellants submit diat independent claims 1, 20 and 39 are allowable over die 
references. 

I. A ppellants^ Dependent Haims Are Patentab le Over The Referenggs 

Dependent claims 2-11, 21-30 and 40-49 are submitted to be allowable over tiae references in 
the same manner, because diey arc dependent on independent claims 1, 20 and 39, respectively, and 
thus contain all the limitations of independent claims 1, 20 and 39. In addition, dependent claims 2- 
6, 21-25 and 40-45 redte additional novel elements not shown by the references. 

With r^ard to claims 2, 21 and 40, which redte "determining a number of build processes 
based on the number of sort processes " the Office Action asserts diat diese limitations are described in 
Hintz at col. 5, lines 50-51. Appellants' attorney disagrees. At the indicated location, Hintx merely 
states that the BUILD routine builds indexes feom the sorted index work file, one index at a time. 

Wiiii rcg&rd to rU^m^ 3, 22 and 41. which icdte that "the maraber of sort processes does not 
exceed a number of indexes to be built," the Office Action asseris diat diese limitations are described in 
Flintz at coL 5, lines 50-51. Appellants' attorney disagrees. At die indicated location, Hint^ merely states 
diat the BUILD routine builds indexes from the sorted index work file, one iadex at a time. 

With regard to claims 4, 23 and 42, which redte that *'the number of load processes does not 
exceed a number of partitions to be loaded," the Office Action alserts that tiiese limitations are 
described in Botdonaro in FIG. 3 (310 and 312), FIG, 2 (202), an col. 4, line 62 coL 6, line 27, md coL 
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5. lines 58-60. Appelknts' attorney disagrees. At the indicated locations, Botdonato merely describes 
pardiioimg sorted Usts into N tasks, and sorting N tasks on N processors, 

Widi regard to daitns 5, 24 and 43, which recite that '*the total number of load and sort 
processes does not exceed processing capabilities;* the Office Action as^^ats diat these limitations are 
described in Bordonato at coL 1, lines 57-68. AppeDants' attorney disagrees. At the indicated location, 
Bordonaro merely describes sorting. 

With regard to daims 6, 25 and 44, which redte that "the memory utilized by die load and sort 
processes does not exceed memory constraints," the Office Action asserts that these limitations are 
described in Bordonaro at coL 5, lines 54-62. Appellants^ attorney disagrees. At the indicated location, 
Bordonaro merely describes the search bounds of the sorted lists read into common memory, 

Witii regard to j-l^imQ 7, 26 and 45, wbidi recite that "die number ofload processes and the 
number of sort processes each requite different processiog power/' tiiese daitns stand or fall with 
independent claims 1, 20 and 39, respectively. 

Widi regard to claims 8, 27 and 46, which redte that '"the number of load processes and the 
number of sort processes each require similar processing power/* these daims stand or &11 with 
independent claims 1 , 20 and 39, respectively. 

With regard to claims 9, 28 and 47, which redte that "the number ofload processes is not equal 
to tiae number of sort processes," diese claims stand or fall with independent daims 1, 20 and 39, 
respectively. 

With regard to daims 10, 29 and 48^ which recite that "the number ofload processes is equal to 
the number of sort processes " these claims stand or fall with independent claims 1, 20 and 39, 
respectivdy. 

With regard to claims 11,30 and 49, which redte that *'the number of load processes is equal to 
the number of sort processes and which is equal to half of the processing capabilities," these daims 
stand or fell widi independent claims 1, 20 and 39, rcspectivdy. 

IX. CONCXUSION 

In light of the above arguments, Appdlants respectfully submit diat the dted references do 
not antidpate nor render obvious the claimed invention. Mote specifically. Appellants' ckims redte 
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novel physical featuies which patentably distingoish over any and all references under 35 U.S-C. §§ 
102 and 103. As a resiat. a d<xision by the Board of Patent Appeals and Intetfctences reversing the 
Examiner and directing allowance of the pending claims m the subject application is respectfully 
solicited. 

Respectfully subtnitted, 

GATES & COOPER ULP 
Attorneys for Appellants 

Howaid Hughes Centex 
6701 Center Drive West, Suite 1050 
Los Angeles, CaUfomia 90045 
(310) -^ai-A™? 



Dare: Aii gu.st 3. 2Q04 

Name: ^ebrge 
Reg. No.: 33,500 

GHG/amb 

O&C 30571 .279-US-Ol 




22 



Dkt: ST9-99-130 



PAGE 25l96*RCVDATO20(l44:16;57PM [Eastern Daylight TimerSVR:USPT0€FXRF-1l2*DNI^ 



OB-03-2004 12:34PM FROW-Gatas & Cooper LLP 



+13106418798 



T-958 P. 026/096 F-098 



APPENDIX 

L A method of loading data into ^ dam store connected to a computer, the method 
comprising the steps o£ 

identifying meuiory constraints; 
identifying processing capabilities; and 

deceroiining a numbct ofload and sort processes to be starred in parallel based on the identified 
memory constraints and processing capabilities. 

2. The method of daim 1, further comprising determining a number of build processes 
based on die nutaber of sort processes. 

3. The method of claim 1, wherem the number of sort processes does not exceed a 
number of indexes to be built 

4. The mediod of claim 1, wherein die number ofload processes does not exceed a 
number of pardtions to be loaded 

5. The method of rh ^ Jv 1 , wherein the total number of load and sort processes does not 
exceed processing capabilities. 

6. The mediod of claim 1 , wherein the memory utilized by the load and sort processes 
does not exceed memory constraints. 

7. The method of daim 1, wherein the number of load processes and the number of sort 
processes each requite different processing power. 

8. The method of claim 1 , wherein the aumber ofload processes and the number of sort 
processes each require similar processing power. 

9. The method of r^^^^ 1 , wherein the number of load processes is not equal to the 
number of sort processes* 
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10. The method of daiml.wherdn the num 
of sort processes. 

11. The jnethod of claim 1 , therein the number of load processes is equal to the number 
of sort processes and which is equal to half of the processing capabilities. 

12. The method of ^^^itn 1, wherein a number of indeites is less than half of the processing 
capabilities, wherein a numbej: of sort processes is equal to die number of indexes, and furdier 
comprising determining that a number of load processes is the smaller of the difference of the 
processing capabilities available for the load processes and the number of sort processes, and a number 
ofpardtioas. 

13. The method of claim 1, wherein a number of partitions is less than half of the 
processing capabilities, wherdn a number of load processes is equal to the number of pardtions, and 
fbrther cotnprising determining that a number of sort processes is die smaller of the difference of the 
processing capabiliries available for the sort processes and the nurxiber of load processes, and a number 
of indexes. 

1 4. The method of rUm l ^ wherein a number of indexes is less than the difference of the 
total amount of available memory and the amoxmt of memory required for a main process, divided by 
the amount of memory required for each load and sort process, wherein a number of sort processes is 
equal to the number of indexes, and furdiet cotnpnsiag determining that the number of load processes 
is the smaller of the difference of a total amount of available memory and tbe amount of memory 
required for the m^iin process, and the amount of memory for each sort process mulriplied by the 
number of indexes, divided by the memory required for each load process, and a number ofpardrions. 

15. The method of daim 1, wherein die number of parddons is less than die difference of 
the total amount of available memory and the amount of memory required for a main process, divided 
by the amount of memory required for each load an sort process, wherein a nutnber of k)ad processes 
is equal to the number of partitions, and further comprising determining that die number of sort 
processes is the smaller of the difference of the total amount of available memory, the amount of 
memory required for the main process, and the amount of memory foi each load process multiplied by 
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the number of partitions, divided by the memory required for each sort process, and a number of 
indexes. 

16. The method of claim 1, wherein a nvimber of load processes is equal to a number of 
sort processtis which is eqwl to the difference of the total amounc of available memory available and 
the amount of memory required for a main process, divided by the amount of memory required for 
each load and SOtt process. 

17. The method of ^1<*i*v> 1, wherein the number of indexes is less than the difference of the 
total amount of available memory, tlie amount of memory required for a main process, and the amount 
of iruOTory required fox each load process multiplied by the processing capabiliries, divided by the 
difference of the amount of memory required for each sort process and each load process, wherein a 
number of sort processes is equal to the number of indexes, and further comprising det ermining drnt 
the number of load processes is the smaUer of the difference of the processing capabilities and the 
number of indexes^ and a number of partitions- 

18. The method of claim 1. wherein the number of partitions is less than the difference of 
the sum of the amount of memory required for each sort process multiplied by die processing 
capabihries, the total arhount of memory required for a main process, and the amount of memory 
required for each load process, divided by the difference of die amount of memory required for reach 
sort process and each load process, wherein a numiber of load processes is equal to die number of 
partitions, and further comprising determining that the number of sort processes is the smaller of the 
difference of the total amount of available memory, the amount of memory required for die main 
process, md the amount of memory for each load process mulripKed by the number of parririonSj 
divided by the memory required for each sort process, and a number of indexes, 

1 9. The method of daim 1 , wherein a number of sort processes is equal to difference of the 
total amount of available memory, die amount of memory required for a main process, and the amoiint 
of memory required for each load process, divided by die difference of the amount of memory required 
for each sort process and each load process, and wherein the number of load processes is equal to die 
difference of the processing capabilities and the number of sort processes. 
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20. An apparatus for executing paxalld load operanon^, comprising: 

a computer having a data stoJCC coupled thereto, wherem the data store stores data; and 
one or more computer programs, performed by the computei;, for idenrifying memory constrams. 
idenrifying processing capahflities, and determining a number of load and sort processes to be started in 
parallel based on the identified mmiory constraints and processing capabilities. 

21. The apparams of daim 20, fmrher comprising detmamdng a number of build processes 
based on die number of sort processes. 

22. The apparatus of 20^ wherein the number of sort processes does not exceed a 
number of iadexes to be built. 

23. The apparatus of claim 20. wberdn the number of load processes does not exceed a 
number of partitioUs to be loaded. 

24. The apparatus of daim 20, wherein the total number of load and sort processes does 
not eacceed processiag capabilities. 

25. The apparatus of daim 20, wherein the memory utilized by the load and sort processes 
does not exceed memory constratats. 

26. The apparatus of claim 20, wherein the number of load processes and the number of 
sort processes each requite different processing power. 

27. The apparatus of ^^^^^ 20, wherein the number of load processes and the numbet of 
. sort processes each require similar processiag power, 

28. The apparatus of daim 20, wherein the number of load processes is not equal to the 
number of sort processes, 

29. The apparams of claim 20, wherein the number of load processes is equal to the 
number of sort processes. 
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30. The apparatus of daim 20, whetdn the nvmh^ of load processes is equal to the 
nuoiber of sort processes which is equal to half of the processing capabilities, 

31. The apparatus of claim 20, wherein a nurnber of indexes is less thanhalf of the 
processiDg capabiUties, wherein a number of sort processes is equal to tlie number of indexes, and 
father comprising determining th^t a number of load processes is die smaller of the difference of the 
processing capabilities available for the load processes and the number of sort processes, and a number 
of partitions. 

32. The apparatus of claim 20, wherein a nxatnber of partitions is less than half of the 
processing capabilities, wherein a number of load processes is equal to the number of partitions, and 
fbxdiei comprising detemiining diat a number of sort processes is die smaller of the difference of die 
processing capabilities available for the sort processes and the number of load processes, and a number 
of indexes. 

33. The appatams of daim 20, wherein a number of indexes is less than the difference of 
die total amount of available memory and the amount of memory required for a main process, divided 
by die amount of memory required for each load and sort process, wherein a number of sort processes 
is equal to the number of indexes, and further comprising determining that die number of load 
processes is the smaller of the difference of a total amount of available memory and die amount of 
memory required for die main process, and the arnount of memory for each sort process multiplied by 
the number of indexes, divided by the memory required for each load process, and on a number of 
partitions. 

34. The apparatus of claim 20, wherein the number of partirions is less than die difference 
of die total amount of available memory and the amount of memory required for a main process, 
divided by the amount of memory required for each load and sort process is equal to the number of 
partitions, and fiuther comprising determining that the number of sort processes is the smaller of die 
difference of the total amount of available memory, the amount of memory required for the main 
process, and die amount of memory for each load process multiplied by die number of partitions, 
divided by die memorj^ required for each sort process, and a number of indexes. 
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35. The apparatus of cW 20, wheicin a number of load processes is equal to a number of 
sort processes which is equal to the difference of the total amount of available memory available and 
the amount of memory required for a mdn process, divided by the amount of memory required for 
each load and sort process. 

36. The apparams of daim 20, wherein the number of indexes is less than the difference of 
the total amount of available memory, the amount of memory required for a main process, and the 
iimount of meiTiory required for each load process multiplied by die processing capabilities, divided by 
the difference of die amount of memory required for each sort process and each load process, wherdn 
a number of sort processes is equal to the number of indexes, and further comptismg determining that 
die ntimber of load processes is the smaller of the difference of die processing capabilities and die 
number of indexes and, a number of partitions. 

37. The apparatus of daim 20, wherein die number of partitions is ks$ than the difference 
of the sum of the amount of memory required for eadi sort process multiplied by die processing 
capabiKties, the total amount of memory required for a main process, and die amount of memory 
requited for each load process, divided by die difference of the amount of memory required for each 
sort process and each load process, wherein a number of load processes is equal to the number of 
partitions, and further comprising determining that die number of sort processes is the smaller of die 
difference of the total amount of available memory, die amount of memory required for the main 
process, and die amount of memory for each load process multiplied by die n^ambet of partitions, 
divided by the memory required for each sort process, and a number of indexes, 

38. The apparatus of daim 20, wherdn a number of sort processes is equal to difference of 
the total amount of available memory, the amount of memory required for a toain process, and the 
amount of memory required for each load process, divided by the difference of die amount of memory 
required for each sort process and each load process, and wherein die number of load processes is 
equal to the difference to the processing capabilitiea and the number of sort processes. 
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39. An article of manufacture comprising a program storage medium readable by a 
computer and embodying one or more insttactions executable by the computer to petform method 
steps for loadidg data into a data store connected to a computer, the method comprising the steps o£ 

identifying memory constraints; 
identifying processing capahilitiesj and 

determining a number of load and sort processes to be started in parallel based on the identified 
memory constraints and processing capabiliries. 

40. The article of tnanufecture of daim 39, further comprising determining a number of 
build processes baaed on fhe number of sort processes. 

41. The article ofmariufactwreofclflim 39, wherein the number of sor^ 
exceed a number of indexes to be built 

42. The article of manufacture of claim 39, wherein the number of load processes does not 
exceed a number of partitions to be loaded. 

43. The article of mannticture of claim 39, wherein the total number of load and sort 
processes does not exceed processing capabiliries. 

44. The article of manuficture of daim 39, wherein the memory utilized by the load and 
sort processes does nor exceed raemoiy constraints. 

45. The aiticic of manufacture of daim 39> wherein the number of load processes and the 
number of $ort processes each require difiEerent processing power* 

46. The ardde of manufecture of daim 39, wherein the number of load processes and tfhe 
number of sort processes each requite similar processing power. 

47. The ardde of manufacture of daim 39, wherein the number of load processes is not 
equal to the number of sort processes. 
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48. The arride of imnufecture of claim 39, wherein the aumbei of load processes is equal 
to the aiamber of sort processes. 

49. The Betide of manu&cmre of daim 39, wheiein the number of load processes is equal 
to the number of sort processes which is equal to half of the ptocessing capabilities, 

50. The artidc of manufacture of daim 39, wherein a number of indexes is less than half of 
the processing capabilities, wherein a number of sort processes is equal to die number of sort processes 
is equal to the number of indexes, and further con^rising, determining thzz a number of load processes 
is die smaller of the difference of the processing capabilities available for the load processes and tbe 
number of sort processes, and a number of pamtloios, 

51 . The artidc of manufacture of daim 39, whetdn a number of partitions if less tb^ half 
of dac processing capabilities, wherein a number of load processes is equal to the numbec of parUtions, 
and further comprising determining that a number of sort processes is the smaller of the difference of 
die processing capabilities available for the sort processes and the number of load processes, and on a 
number of indexes. 

52. The artide of manu&ctuie of claim 39, wherein 0. number of indexes is less than die 
difference of the total amount of available memory and the amount of memory required for a main 
process, divided by the amount of memory required for each load and sort process^ wherein a number 
of sort processes is equal to the number of indexes, and fiirther comprising detetmimng that the 
number of load processes is the smaller of the difference of a total amount of available memory and the 
amount of memory required for die main process, and the amount of memory for each sort process 
multiplied by the number of indexes, divided by die memory required for each load process and, a 
number of partitions. 

53. The artide of manufacture daim 39, wherein the number of partitions is less than d»e 
difference of the total amount of available memory and the amount of memory required for a main 
process, divided by the amount of memory required for each load and sort process, wherein a number 
of load processes is equal to the number of pardnons, and further comprising determining that the 
nvunber of sort processes is die smaller of die difference of the total amount of available memory, the 
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amount of memoiy required for the maiix process, and die amount of memotjr for each load process 
multiplied by the number of partitions, divided by the memory required for each sort process, and a 
number of indexes. 

54. The article of manufacture of claim 39, wherein a number of load processes is equal to 
a number of sort processes which is equal ro the difference of die total amount of avaikble memory 
available and the amount of memory required for a main process, divided by the amount of memory 
required for each load and sort process. 

55. The axtidc of manufiicmre of ^"^^m 39, wherein the number of indexes is less than the 
difference of the total amount of available memoryj the amount of memory required for a main 
process, and the amount of memory required for each load process mult^licd by the processing 
capabiliues, divided by the difference of die amount of memory required for each sort process and each 
load process, wherein a nijtnbet of sort processes is eqi!ial to the number of indexes, and further 
comprising determining that die number of load processes is the smaller of the difference of die 
processing capabilities and the number of indexes and, a number of partitioas. 

56. Hie axdde of manufacture of f]<^im 39, wherein the number of parddons is less than the 
difference of die sum of the amount of memory required for each sort process multiplied by the 
processing capabilities, the total amount of memory required for a main process, and the amount of 
memory required for each bad process, divided by the difference of the amount of memory required 
for each sort process and each load process, wlierem a number of load processes is equal to die number 
of partitions, and further comprising determining that die number of sort processes is the smaller of the 
difference of the total amoxmt of available memory, die amount of memory required for the main 
process, and the amount of memory for each load process multiplied by the number of partitions, 
divided by the memory requited for each sort process, and a number of indexes. 

57. The article of manufectuxe of daim 39, wherein a number of sort processes is equal to 
difference of the total amotint of available memory, die amount of memory required for a main 
process, md the amomit of memory required for each load process, divided by die difference of tlie 
amount of memory required for each sort process and each load process, and wherein die niombct of 
load processes is equal to the difference of the processing capabilities and the number of sort processes, 
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